Recognizing Call-center Speech Using Models Trained from Other Domains

نویسندگان

Erica Bernstein

Don McAllaster

Larry Gillick

Barbara Peskin

چکیده

In this paper, we introduce a new conversational speech task – recognizing call-center speech – using data collected from Dragon’s own technical support line. We compare performance of models trained from conversational telephone speech (the Switchboard corpus) and models trained from predominantly read, microphone speech, and report on a series of experiments focusing on adapting the microphone speech models to the telephone channel and conversational task. We also discuss the importance of task-specific language model data. We benchmark our test set by comparing the performance of our 1998 Switchboard Evaluation system to that of our simpler call-center system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Call Center Conversations exhibiting Similar Affective States

Automatic detection and identifying emotions in large call center calls are essential to spot conversations that require further action. Most often statistical models generated using annotated emotional speech are used to design an emotion detection system. But annotation requires substantial amount of human intervention and cost; and may not be available for call center calls because of the in...

متن کامل

Using English Acoustic Models for Hindi Automatic Speech Recognition

Bilingual speakers of Hindi and English often mix English and Hindi together in their everyday conversations. This motivates us to build a mix language Hindi-English recognizer. For this purpose, we need well-trained English and Hindi recognizers. For training our English recognizer we have at our disposal many hours of annotated English speech data. For Hindi, however, we have very limited res...

متن کامل

A Review of Automatic Speaker Age Classification, Recognition and Identifying Speaker Emotion Using Voice Signal

Accurate gender classification is mostly convenient in case of speech and speaker recognition and also in speech emotion classification; since a superior performance has been stated when separate acoustic models are employed for males and females. Gender classification is also specious into face recognition, particular video summarization, human or robot interaction (HCI), etc. In various crimi...

متن کامل

Vector-based Natural Language Call Routing

This paper describes a domain independent, automatically trained natural language call router for directing incoming calls in a call center. Our call router directs customer calls based on their response to an open-ended “How may I direct your call?” prompt. Routing behavior is trained from a corpus of transcribed and hand-routed calls and then carried out using vectorbased information retrieva...

متن کامل

Architectures for Speech-to-Speech Translation Using Finite-state Models

Speech-to-speech translation can be approached using finite state models and several ideas borrowed from automatic speech recognition. The models can be Hidden Markov Models for the accoustic part, language models for the source language and finite state transducers for the transfer between the source and target language. A “serial architecture” would use the Hidden Markov and the language mode...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

Recognizing Call-center Speech Using Models Trained from Other Domains

نویسندگان

چکیده

منابع مشابه

Mining Call Center Conversations exhibiting Similar Affective States

Using English Acoustic Models for Hindi Automatic Speech Recognition

A Review of Automatic Speaker Age Classification, Recognition and Identifying Speaker Emotion Using Voice Signal

Vector-based Natural Language Call Routing

Architectures for Speech-to-Speech Translation Using Finite-state Models

عنوان ژورنال:

اشتراک گذاری